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Abstract 

While a number of studies have measured multivariate sexual selection acting 
on sexual signals in wild populations, few have confirmed these findings with 
experimental manipulation. Sagebrush crickets are ideally suited to such investi- 
gations because mating imposes an unambiguous phenotypic marker on males 
arising from nuptial feeding by females. We quantified sexual selection operat- 
ing on male song by recording songs of virgin and mated males captured from 
three wild populations. To determine the extent to which selection on male 
song is influenced by female preference, we conducted a companion study in 
which we synthesized male songs and broadcast them to females in choice 
trials. Multivariate selection analysis revealed a saddle-shaped fitness surface, 
the highest peak of which corresponded to longer train and pulse durations, 
and longer intertrain intervals. Longer trains and pulses likely promote greater 
mate attraction, but selection for longer intertrain durations suggests that ener- 
getic constraints may necessitate "time outs". Playback trials confirmed the 
selection for longer train and pulse durations, and revealed significant stabiliz- 
ing selection on dominant frequency, suggesting that the female auditory sys- 
tem is tightly tuned to the species-specific call frequency. Collectively, our 
results revealed a complex pattern of multivariate nonlinear selection character- 
ized primarily by strong stabilizing and disruptive selection on male song traits. 
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Introduction 

Male acoustic signals play an important role in mate 
attraction in a number of taxa including birds (Catchpole 
and Slater 1995), mammals (McComb 1991), anurans 
(Gerhardt and Brooks 2009), and insects (Alexander 
1962; Huber et al. 1989; Robinson and Hall 2002), and 



are a frequent target of sexual selection (Andersson 1994). 
Acoustic signaling can impose high energetic costs on 
males (Prestwich and Walker 1981; Bailey et al. 1993; 
Hoback and Wagner 1997; Hack 1998), and thus song 
may also serve as an honest indicator of male quality 
(Zahavi 1975; Burk 1988). Males that are able to gain 
access to and efficiently convert resources into attractive 
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sexual advertisements are likely to be of superior quality 
and achieve a higher mating success. Females may impose 
directional (linear) sexual selection on males by choosing 
mates that exhibit the greatest calling effort (e.g., Brooks 
et al. 2005; Bentsen et al. 2006; Gerhardt and Brooks 
2009). Additionally, females may also exert stabilizing 
(nonlinear) selection through a preference for males that 
produce song with intermediate structural song character- 
istics and frequencies (e.g., Brooks et al. 2005; Bentsen 
et al. 2006). Males that produce song beyond these inter- 
mediate thresholds might be expected to have lower 
reproductive success because they are not recognized as 
conspeciiics or escape detection by the female auditory 
system. 

While extensive research has focused on how sexual 
selection influences the evolution of acoustic mating sig- 
nals, up until recently, a majority of experimental studies 
have investigated only a single song characteristic at a 
time (e.g., Forrest 1980; Hedrick 1986; Latimer and 
Sippel 1987; Ritchie 1996; Hunt et al. 2006). This is bio- 
logically unrealistic because selection rarely acts individu- 
ally upon a single trait (Lande and Arnold 1983; Blows 
2007). Instead, selection typically acts on combinations of 
traits that collectively affect an individual's reproductive 
success (Lande and Arnold 1983). By only investigating 
single song characteristics in isolation, differences in fit- 
ness that are caused by a combination of traits could 
erroneously be attributed to a single modified trait (Hunt 
et al. 2009). A better method, yielding a more realistic 
view of selection, is the use of multiple regression and 
canonical analysis to measure the direction, shape, and 
strength of selection acting upon a suite of characters 
(Lande and Arnold 1983; Phillips and Arnold 1989). 

Numerous studies on insects have measured multivari- 
ate sexual selection in both the laboratory (e.g., Blows 
et al. 2004; Brooks et al. 2005; Chenoweth and Blows 
2005; Hall et al. 2008; Judge 2010; Gershman et al. 2012) 
and in wild populations (e.g., LeBas et al. 2003, 2004; 
Bentsen et al. 2006; Bussiere et al. 2008; Punzalan et al. 
2010; Robson and Gwynne 2010; Wheeler et al. 2012). 
While these studies have played an important role in 
revealing the complexity of sexual selection (Hunt et al. 
2009), it is also important to recognize some of the limi- 
tations of this approach. First, a limitation common to all 
conventional selection analyses is that this approach is 
correlational and therefore any relationship between a 
trait and fitness may not necessarily be causal. This is 
especially problematic in field studies where a myriad of 
external factors are likely to influence selection (Endler 
1986) and can be circumvented by experimentally manip- 
ulating the sexual traits being examined (e.g., Bentsen 
et al. 2006; Gerhardt and Brooks 2009). However, despite 
earlier calls to do so (Endler 1986), relatively few studies 
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have used experimental manipulation to quantify the 
strength and form of sexual selection (but see Brooks 
et al. 2005; Bentsen et al. 2006; Gerhardt and Brooks 

2009) , particularly to confirm the findings of conventional 
multivariate selection analysis. Second, the biological rele- 
vance of selection gradients depend heavily on the esti- 
mates of fitness used. In most field studies of sexual 
selection in insects, pairing success is used as a proxy for 
fitness (e.g., LeBas et al. 2003, 2004; Bussiere et al. 2008; 
Punzalan et al. 2010; Wheeler et al. 2012) and it is not 
always known whether males found in copula actually 
obtain a successful mating (but see Robson and Gwynne 

2010) . This is almost certainly a consequence of the diffi- 
culty in measuring actual male mating success under nat- 
ural conditions. Sagebrush crickets, Cyphoderris strepitans 
(Orthoptera: Haglidae), offer an excellent model system in 
this regard because successful mating imposes an unam- 
biguous phenotypic marker on males that results from an 
unusual form of nuptial feeding by females (Morris 1979; 
Eggert and Sakaluk 1994). 

The sagebrush cricket, C. strepitans (Orthoptera: Hagli- 
dae), is one of seven known surviving species of hump- 
winged grigs, a lineage that was highly diverse between 
the late Permian to early Cretaceous periods, but is now 
nearly extinct (Kumala et al. 2005). C. strepitans occurs in 
high-elevation sagebrush meadows nestled within conifer- 
ous forests in Wyoming and Colorado (Morris and 
Gwynne 1978). In Grand Teton National Park where the 
majority of field studies of C. strepitans have been con- 
ducted (Sakaluk and Ivy 1999; Sakaluk et al. 2004; Leman 
et al. 2009), sexual activity commences in mid-May. Each 
night of the breeding season, males emerge from the 
ground cover to secure a calling perch in sagebrush or 
lodgepole pine, from where they sing to attract sexually 
receptive females (Snedden and Sakaluk 1992; Snedden 
and Irazuzta 1994). The songs of males have a relatively 
simple structure comprised of trains of varying lengths 
made up of pulses emitted from single wing closures 
(Fig. 1). Once a female has located a male, copulation is 
initiated when the female climbs onto the dorsum of the 
male and ends with the transfer of a spermatophore to 
the female. During copulation, the female feeds on the 
male's fleshy hind wings and ingests hemolymph seeping 
from the wounds she inflicts (Morris 1979; Eggert and 
Sakaluk 1994). The wing-feeding behavior of females pro- 
vides a powerful tool for documenting male mating suc- 
cess in the wild because the mating status of males can be 
easily determined by examining their hind wings for the 
melanized scars resulting from mating (Fig. 2). 

An earlier study, in which males were experimentally 
muted and returned to their natural population, revealed 
that song is essential to male mating success (Snedden 
and Sakaluk 1992). However, the features of male song 
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Figure 1. The five focal song characters represent the primary structural characteristics of sagebrush cricket song. Pulse duration (PD) represents 
a single wing closure and interpulse duration (IPD) is the interval between wing closures. Train duration is the duration of a continuous train of 
pulses and intertrain duration is the interval between trains. Sagebrush cricket dominant frequency (DF) is approximately 13 kHz as revealed by 
the spectrogram. 




Figure 2. Hind wings of nonvirgin males (left) have melanized 
wounds resulting from nuptial feeding of females during mating, 
while virgins have uniformly white, intact hind wings. The forewings, 
used to produce song, were removed to afford a better view of the 
hind wings. 

that are subject to selection, and the extent to which male 
song traits are shaped by sexual acoustic interactions 
between competing males, female mating preferences, or 
both, remain unknown. In the congener, Cyphoderris 
monstrosa, territorial males engage in fierce physical con- 
tests and employ song in the context of territorial signal- 
ing (Mason 1996). In contrast, we have rarely observed 
physical interactions between male C. strepitans (Sakaluk 
et al. 1995), and although males singing in close proxim- 
ity occasionally seem to interact acoustically (G. D. O. 
and S. K. S., pers. obs.), male calling appears to function 
primarily to facilitate female phonotaxis (Snedden and 



Irazuzta 1994). In the present study, we attempted to 
conduct a complete multivariate selection analysis of song 
traits of males by recording songs of virgin and nonvirgin 
males captured from three wild populations, and employ- 
ing multiple regression and canonical analysis to measure 
the direction, shape, and strength of selection acting upon 
this suite of sexual characters (Lande and Arnold 1983; 
Phillips and Arnold 1989). To determine the extent to 
which sexual selection on male song in the wild is influ- 
enced by female song preferences, we conducted a second 
study in which we synthesized male songs designed to 
uncouple correlations between male song traits and to 
capture the full range of phenotypic combinations of song 
parameters, and broadcast these song types to females in 
dichotomous choice trials staged in outdoor arenas. By 
adopting this two-pronged approach, measuring selection 
on males mating in the wild in concert with experimental 
manipulation of male song, we aimed to attain a more 
complete picture of the multivariate sexual selection act- 
ing on male sexual signals than hitherto possible using 
one or the other protocol. 

Methods 

Experiments were conducted in Grand Teton National 
Park, Wyoming, where the high-altitude sagebrush mea- 
dow habitat of sagebrush crickets is abundant. Subjects 
were captured from three populations all found within 
the Snake River basin: Deadman's Bar (43°45'33.91"N, 
110°37'25.12"W), Pacific Creek (43°51'25.67"N, 110°31' 
8.41"W), and Bridger-Teton (43°54'40.56"N, 110°28' 
20.24"W). Captured individuals were transported to the 
University of Wyoming-National Park Service (UW-NPS) 
Research Station (43°56'23.51"N, 110°38'29.06"W). 
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Experiment 1: measuring multivariate 
sexual selection on male song 

Sampling populations 

We measured sexual selection on the songs of male sage- 
brush crickets by capturing and recording the songs of 
virgin and nonvirgin males. Each of the three populations 
was censused binightly in 2009 by capturing males and 
inspecting their hind wings for damage to ascertain their 
mating status (Fig. 2). Once the populations had reached 
a ratio of approximately 50:50 virgin to mated males, 
approximately 200 males were captured. This protocol 
ensured that females were given ample opportunity to 
mate with the most attractive males in the population. 

Captured males were placed into mesh recording 
chambers and provisioned with a diet of sagebrush galls 
and sliced apple. On subsequent evenings, the songs of 
males were digitally recorded directly to a computer for 
at least 2 min using a unidirectional condenser micro- 
phone (BG 4.1; Shure Inc., Niles, IL). On average, males 
were recorded within 2.7 (±1.1 SD) nights of capture. 
Raven Pro 1.4 (Cornell Lab of Ornithology, Ithaca, NY) 
was used to measure the five focal song characters 
(Fig. 1): pulse duration (PD), interpulse duration (IPD), 
train duration (TD), intertrain duration (ITD), and dom- 
inant frequency (DF). 

Because temperature could not be closely controlled 
during the song recordings (15.7 ± 1.6°C SD), each of 
the five focal song characters was regressed against 
temperature, and temperature-adjusted trait values were 
obtained by adding the residuals to the mean expected 
trait value. This was necessary because temperature has 
well-documented effects on sagebrush cricket song and 
behavior (Morris and Gwynne 1978; Sakaluk and 
Eggert 2009,) and accounting for the variation attributable 
to temperature should yield better estimates of the actual 
selection gradients for the song characters being examined. 

Multivariate selection analysis 

Fitness surfaces were estimated using the multiple regres- 
sion-based methodology of Lande and Arnold (1983). 
Mating success of males was scored using a dichotomous 
fitness measure of 0 for virgins and 1 for nonvirgins. As 
suggested by Lande and Arnold (1983), the fitness scores 
were transformed to have a mean of 1 (i.e., fitness was 
made relative) and the focal song characters were trans- 
formed into z-scores with a mean of 0 and a standard 
deviation of 1. To estimate the standardized linear selec- 
tion gradients (/?), a first order multiple regression was fit 
using the standardized song data as the predictor vari- 
ables and relative fitness as the response variable. We used 
a second order quadratic multiple regression model that 
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included all linear, quadratic, and correlational terms to 
estimate the matrix of standardized nonlinear selection 
gradients (y) that describes the curvature of the response 
surface. Quadratic regression coefficients are known to 
underestimate by a factor of 0.5 and we therefore doubled 
the quadratic selection gradients from this model (Stinch- 
combe et al. 2008). 

Interpreting the size and significance of individual y 
coefficients can underestimate the strength of nonlinear 
selection (Phillips and Arnold 1989; Blows and Brooks 
2003). Therefore, we examined the extent of nonlinear 
sexual selection by performing a canonical analysis to 
locate the major axes (i.e., eigenvectors) of the response 
surface (Phillips and Arnold 1989). The double regression 
method of Bisgaard and Ankenman (1996) was used to 
estimate the strength of linear (0 ; ) and nonlinear (A ; ) 
selection along each of the eigenvectors (mi) of the 
response surface. 

As recommended by Mitchell-Olds and Shaw (1987), 
randomization tests (with 10,000 iterations) were used to 
assess the significance of all selection gradients, due to 
nonnormality of the response variable (mating success) 
and four of the five predictor variables (IPD, TD, ITD, 
and DF). Randomization tests were also used to assess the 
significance of linear and nonlinear sexual selection acting 
on the eigenvectors of y. For each permutation, the fitness 
scores were shuffled across cricket calls while the eigen- 
vectors were held constant, which created a null distribu- 
tion with no relationship between the fitness scores and 
eigenvectors (see Lewis et al. 2011 for full details of this 
randomization procedure). Because the eigenvectors were 
held constant, this procedure differed from that of 
Reynolds et al. (2010) in which a new canonical rotation 
is conducted for each iteration (i.e., selection on different 
eigenvectors are tested at each iteration). We used this 
approach as we were interested in whether specific eigen- 
vectors experience more linear and nonlinear sexual selec- 
tion than expected by chance rather than if there was 
significant linear or nonlinear sexual selection occurring 
overall (Chenoweth et al. 2012). 

Fitness surface visualization 

Fitness surfaces were visualized by fitting thin-plate 
splines (Phillips and Arnold 1989; Green and Silverman 
1994) to the major axes identified by canonical analysis. 
Thin-plate splines were estimated using the Tps function 
of the fields package (Furrer et al. 2012) of R (version 
2.13.0, http://www.r-project.org/). The smoothing param- 
eter of the Tps function was set to a value that minimized 
the generalized cross-validation score (Green and Silver- 
man 1994). We plotted the surface in R using both the 
perspective and contour map views. 
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Comparing the strength and form of sexual 
selection across populations 

We tested whether linear, quadratic, and correlational 
selection differed among populations using a sequential 
model building procedure for response surface designs 
(Draper and John 1988; Chenoweth and Blows 2005). 
Because the population selection gradients were not sig- 
nificantly different when traits were standardized and fit- 
ness made relative within populations (see Results 
section), the populations were pooled to achieve greater 
statistical power. In our analysis of pooled data, traits 
were standardized and relative fitness calculated across 
populations. Pooling data across populations brought us 
closer (n = 389) to meeting the recommended sample size 
of 500-1000 desirable for reliably detecting nonlinear 
selection (Kingsolver et al. 2001). 

Validating our measurement of sexual selection in 
the wild 

The selection analysis described above assumes that males 
in the virgin and mated groups had equal opportunities 
to mate, an assumption that may be violated if males 
eclose at different times during the breeding season. If 
mated males are those in the population that eclose earlier 
and hence have a longer period to obtain matings than 
males in the virgin group, then differences in the songs of 
virgin and mated males could simply be an artifact of 
age-related changes in male morphology as opposed to a 
causal factor underlying variation in male mating success. 
Although previous studies of sagebrush crickets emerging 
within field enclosures erected at Deadman's Bar have 
suggested that males in the population become sexually 
active within a few days of each other (Snedden 1996), 
they cannot rule out the possibility that differences 
between virgin and mated males are a result of age-related 
changes (i.e., date of appearance in the population). 
Accordingly, we conducted a mark-recapture study in 
2012 to determine if there is an association between first 
appearance in the population and the likelihood of mat- 
ing by males in the field. 

At the beginning of the breeding season, we established 
a study plot at the Bridger-Teton site. This population 
was chosen because no males had mated at our first pop- 
ulation census. On four of the first 5 days of the study, 
we attempted to capture all of the virgin males calling in 
the study area and any mated males that had been previ- 
ously marked. Each male was marked with a numbered 
tag secured to his pronotum with cyanoacrylic glue. Fluo- 
rescent paint was also applied around the pronotum and 
to the femora to facilitate the recapture of marked indi- 
viduals with portable fluorescent lanterns. Seven days 
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after the population was first censused, the population 
attained a ratio of approximately 1:1 virgin to mated 
males, at which time the study was terminated. We 
recorded data on the date of initial capture and mating 
success for 98 males initially marked as virgins. 

We used failure-time analysis to examine the effect of a 
male's capture date on his time to mating (Allison 1995), 
specifically, a Cox regression as implemented by PROC 
PHREG in SAS (version 9.2, SAS Institute Inc 2010). The 
EXACT option was specified in the model statement to 
handle ties, instances in which different males had the 
same time to mating. This option was employed because 
it assumes that mating times are, in reality, continuous 
and ordered, assumptions that are almost certainly met 
by our data. The analysis revealed no significant effect of 
date of capture on time to mating (Wald ^ 2 = 0.32, 
P = 0.57). We also compared virgin (N = 55) and mated 
males (N = 43) with respect to their date of capture. 
Mated males were first captured significantly earlier than 
virgin males (Student's r-test with equal variances, 
f 96 = 2.07, P = 0.042), but the difference in the time of 
their initial appearance in the population was less than 
half a day (0.49 ± 0.24 days [mean ± SE]). We con- 
clude, therefore, that any differences in the songs of virgin 
and mated males stems from their effect on male mating 
success and not from any age-related effects. 

Experiment 2: measuring female 
preferences for male song 

Measuring sexual selection directly on sagebrush cricket 
song poses a problem, because the very act of mating could 
alter male song. Mating results in at least a short-term 
reduction in calling effort due to the refractory interval 
needed to generate a new spermatophore and to recover 
from the hemolymph loss associated with nuptial hind 
wing feeding (Sakaluk et al. 1987, 2004; Sakaluk and Sned- 
den 1990). Hind wing wounding itself does not appear to 
alter song structure, at least not enough that females are 
able to detect (Sakaluk and Ivy 1999). However, hind wing 
wounding may trigger an immune response resulting in an 
energetically costly immunological trade-off that could 
cause a reduction in calling effort (Leman et al. 2009). 
Because even a short-term reduction in singing by nonvir- 
gin males could have influenced the fitness surfaces from 
Experiment 1, it was necessary to conduct a second experi- 
ment in which we directly measured female preference 
with artificially synthesized songs. 

Construction of artificial songs 

While multivariate selection analysis is designed to handle 
correlated phenotypic data (Lande and Arnold 1983), 
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strong correlations among traits can still be problematic if 
it leads to multicollinearity (Mitchell-Olds and Shaw 
1987). Multicollinearity can result if only certain combi- 
nations of traits are expressed in nature due to an evolu- 
tionary history of persistent selection that removes unfit 
phenotypes from the population (Brooks et al. 2005). 
Estimating selection gradients based on these restricted 
subsets of naturally occurring characters could result in 
biased estimates of selection gradients or even prevent 
them from being detected (Mitchell-Olds and Shaw 1987; 
Brooks et al. 2005), because females may prefer acoustic 
signals that are biophysically impossible for males to pro- 
duce (Ryan and Rand 2003). To address these problems, 
we synthesized artificial signals that were designed to be 
phenotypically uncorrelated (e.g., Brooks et al. 2005; 
Bentsen et al. 2006). 

Using a procedure outlined in Brooks et al. (2005), 100 
artificial songs (henceforth referred to as manipulated 
songs) were created that were specially designed so that 
the five focal song characters (Fig. 1) maintained their 
measured univariate means and standard deviations from 
Experiment 1 (Table 1), but were uncorrelated with one 
another. For each focal song character, random normal 
distributions with a mean of 0 and standard deviation of 
1 were independently drawn using the rnorm function in 
the R statistical package (R Development Core Team 
2012). These randomized z-scores represent the number 
of standard deviations an observation would be from the 
mean song character. To transform the z-scores back to 
the original song character distributions, the z-scores were 
multiplied by the measured song character standard devi- 
ation and added to the measured song character mean 
(Table 1). A composite control song was created by set- 
ting each of the five focal song characters to their mea- 
sured mean values from Experiment 1 (Table 1). 

Songs were synthesized using Adobe Audition 3. To 
avoid any potential effect of female temporal or spatial 
bias in the arenas, the manipulated and control songs 
were assigned random start times (between 0.0 sec and 



1.0 sec), and randomly assigned to the left or right stereo 
channel of the generated WAV sound files. The synthe- 
sized songs were transferred to MiniMee Media Players 
(MEElectronics Inc., Walnut, CA) that were used to play 
the songs in the arenas. 

Female phonotaxis trials 

Females were captured from Deadman's Bar in 2010 and 
brought back to the UW-NPS research station where they 
were placed into cages and fed a diet of Purina Cat 
Chow, bee pollen, and sliced apple. This higher quality 
diet allowed the females to be kept in captivity long 
enough to conduct the trials without their condition 
deteriorating. 

Arena choice trials were conducted at a field site near 
the UW-NPS research station, where there are no naturally 
occurring sagebrush crickets, making it a suitable location 
for conducting phonotaxis trials without bioacoustical 
interference from nonexperimental individuals. Four are- 
nas were built separated by 30.5 m, which was a sufficient 
distance for the dense vegetation of the field site to acousti- 
cally isolate the arenas from one another. The arenas were 
built by hammering rebar into the ground and burying 
stainless steel roof flashing around the perimeter of a 
2.0 m x 0.3 m area. Custom built plywood speaker towers 
were placed at the terminal ends of the arena, which posi- 
tioned the stereo speakers (VS2620; Altec Lansing Inc., 
New York, NY) at the average sagebrush height of 0.52 m 
(Snedden 1996) and within the natural height range at 
which males sing (Sakaluk and Eggert 2009). The speaker 
towers also sheltered the electronics from rain and snow 
during the trials. Because females of many orthopteran 
species have been shown to prefer louder males (Bailey 
et al. 1990), we ensured that each speaker was broadcasting 
at 100 dB measured with a digital sound level meter (33- 
2055; Radioshack Inc., Forth Worth, TX) held 5 cm away, 
the approximate amplitude of natural sagebrush cricket 
song (Morris and Gwynne 1978). 



Table 1. Means and standard deviations of song traits of males captured in the wild. 





Recorded songs 




Manipulated songs 




Composite control 


Mean 


SD 


Mean 


SD 


Pulse duration (msec) 


3.268 


0.480 


3.261 


0.478 


3.266 


Interpulse duration (msec) 


27.698 


2.495 


26.760 


2.694 


27.089 


Train duration (msec) 


929.833 


745.923 


1366.763 


808.311 


950.992 


Intertrain duration (msec) 


5020.21 1 


3749.538 


6740.460 


3723.345 


5222.860 


Dominant frequency (KHz) 


13.055 


0.782 


12.994 


0.794 


13.049 



Manipulated song trait means and standard deviations for synthesized songs broadcast to females in choice trials approximated their respective 
recorded means and standard deviations of wild males, but were generated from independently drawn, randomized univariate normal distribu- 
tions. The composite control was set approximately equal to mean trait values of wild males. 
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To minimize behavioral disturbance, holes were 
molded into the ground of the same dimensions as the 
cages in which females were housed. This ensured that 
once a cage was placed in the hole, the soil inside the 
cage was flush with the soil in the arena so that females 
could climb out of the cages of their own volition with- 
out being disturbed (Snedden and Irazuzta 1994). At the 
start of a trial, the cage lids were carefully loosened so 
that they could gently be removed after the female had 
been given 5 min to acclimate to the manipulated and 
control signals being broadcast from the speakers. Females 
were observed throughout the trials to ensure that they 
were displaying phonotaxis behavior. If a female 
approached within 0.3 m of a speaker that was broadcast- 
ing a manipulated signal, it was scored 1 for attractive (or 
0 for unattractive if she chose the control) and the trial 
was ended. If a female did not choose a signal within 
30 min, the trial was ended and repeated on the subse- 
quent evening until the female made a choice. 

Statistical analysis 

Selection analyses were conducted using the same meth- 
ods as in Experiment 1. However, due to inclement 
weather conditions throughout the entire field season in 
2010, we were unable to capture a sufficient number of 
females (« = 25 in total) to test the 100 synthesized sig- 
nals independently. This necessitated using each female in 
four trials, which created a problem of statistical noninde- 
pendence. To test whether individual female choice biased 
the selection gradients, we used a bootstrap resampling 
procedure following Brooks et al. (2005) in which we 
randomly drew one independent observation from each 
of the 25 females, and calculated the selection gradients 
based on this sample. This process was repeated 10,000 
times, which built independent distributions for each 
selection gradient that could be compared with the 
observed selection gradients from pooled trials. If any of 
the observed gradients were biased by individual song 
preferences of the females, we would expect them to fall 
within the tails of their respective distribution of indepen- 
dent gradients. We calculated P values by summing the 
number of | independent gradients | that were greater than 
their respective |observed gradient| and dividing by 
10,000. None of the P values were significant (P = 0.514- 
0.998), so individual trials involving the same female were 
treated as independent in further analyses. 

We compared the strength and form of sexual selection 
acting on songs of males captured in the field and synthe- 
sized songs broadcast to females in choice trials using the 
same sequential model building procedure used to evaluate 
differences between populations (Draper and John 1988; 
Chenoweth and Blows 2005). This procedure allowed the 
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strength of standardized selection gradients between stud- 
ies to be statistically tested, but it cannot be used to com- 
pare fitness surfaces across studies as canonical analysis 
places each study in its own unique canonical space. 
Therefore, to compare differences in sexual selection across 
the two studies in canonical space we calculated the angle 
between the dominant vectors of selection in both studies 
(rri! for the field study and m 5 for our manipulative study) 
following the protocol outlined in Lewis et al. (2011). We 
calculated the 95% credible interval for this angle using a 
Bayesian approach (R code provided on request to GDO) 
implemented in the MCMCglmm package of R (version 
2.13.0) (Hadfield 2010). 

Results 

Experiment 1: measuring sexual selection 
on male song 

Partial F tests from sequential modeling showed no 
significant difference in linear (-F 5j377 = 0.371, P = 0.869), 
quadratic (_F 5> 3 67 = 0.128, P = 0.986), or correlational 
(-Fio,347 = 1-21, P = 0.283) selection gradients among our 
three study populations, allowing the populations to be 
pooled for greater statistical power. 

Standardized linear, quadratic, and correlational selec- 
tion gradients are presented in Table 2A. There was no 
significant linear selection operating on any song trait. 
There was, however, quadratic selection operating on DF 
and significant positive correlational selection between PD 
and IPD, and between TD and ITD (Table 2A). No other 
correlational selection gradients were significantly differ- 
ent from zero. 

Canonical rotation of the y matrix resulted in three ei- 
genvectors with significant nonlinear sexual selection, two 
with positive eigenvalues (m, and m 2 ) and one with a 
negative eigenvalue (m 5 ) (Table 3A). The dominant 
eigenvector of disruptive selection (mi) was heavily influ- 
enced by negative weightings from TD and ITD, and to a 
lesser extent, a negative weighting from PD (Table 3A). 
The dominant eigenvector of stabilizing selection (m 5 ) 
was heavily loaded by a negative weighting from TD and 
a positive weighting from ITD, and to a lesser extent, a 
negative weighting from IPD (Table 3A). The complex, 
saddle-shaped nature of nonlinear sexual selection along 
these two dominant eigenvectors can be visualized in Fig- 
ure 3. The second eigenvector of disruptive selection (m 2 ) 
was positively weighted by PD and IPD and, to a lesser 
degree, negatively weighted by TD (Table 3A); sexual 
selection along this eigenvector can be visualized in Fig- 
ure 4. There was no significant linear sexual selection (0,) 
along any of the major eigenvectors of the fitness surface 
identified by canonical analysis (Table 3A). 



3596 



© 201 3 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. 



G. D. Ower ef al. 



Sexual Selection on Male Cricket Song 



Table 2. The vector of standardized linear selection gradients (fi) and the matrix of standardized quadratic and correlational selection gradients 
for the five focal songs characters of (A) songs of males captured in the wild and (B) synthesized songs broadcast to females in choice trials. 





PD 


IPD 


TD 


ITD 


DF 


A. Songs of males captured in 


the wild 










PD 0.092 ± 0.048 


0.092 ± 0.037 










IPD 0.025 ± 0.049 


0.101 ± 0.051* 


0.013 ± 0.026 








TD -0.013 ±0.051 


0.024 ± 0.067 


-0.126 ± 0.084 


0.031 ± 0.023 






ITD -0.010 ±0.052 


0.117 ± 0.066 


0.067 ± 0.055 


0.302 ± 0.134* 


0.105 ± 0.039 




DF 0.064 ± 0.048 


-0.033 ± 0.042 


0.010 ± 0.045 


0.034 ± 0.082 


-0.020 ± 0.063 


-0.076 ± 0.027* 


B. Synthesized songs broadcast to females 










PD -0.017 ±0.106 


0.233 ± 0.084 










IPD 0.091 ± 0.106 


-0.005 ± 0.119 


-0.136 ± 0.083 








TD 0.159 ±0.107 


0.078 ± 0.137 


0.057 ± 0.112 


0.104 ± 0.096 






ITD -0.076 ±0.106 


0.067 ±0.115 


-0.153 ± 0.110 


-0.100 ± 0.141 


0.093 ± 0.088 




DF 0.084 ±0.107 


0.106 ± 0.168 


-0.182 ± 0.130 


0.257 ± 0.111* 


-0.141 ± 0.106 


-0.359 ± 0.074* 



PD, pulse duration; IPD, interpulse duration; TD, train duration; ITD, intertrain duration; DF, dominant frequency. 
Randomization tests: *P < 0.05, **P<0.01, ***P< 0.001. 



Table 3. Linear (0,-) and nonlinear (/,■) selection gradients and the M matrix of eigenvectors from the canonical analysis of y for (A) songs of males 
captured in the wild and (B) synthesized songs broadcast to females in choice trials. 



M 



e, 


h 


PD 


IPD 


TD 


ITD 


DF 


A. Songs of recorded males 














m, -0.014 ±0.056 


0.203 ± 0.060*** 


-0.323 


-0.014 


-0.601 


-0.730 


0.010 


m 2 0.073 ± 0.050 


0.095 ± 0.039* 


0.642 


0.650 


-0.392 


0.025 


-0.105 


m 3 -0.002 ± 0.044 


-0.020 ± 0.024 


-0.538 


0.606 


-0.005 


0.238 


0.535 


m 4 0.089 ± 0.046 


-0.043 ± 0.023 


0.440 


-0.230 


0.101 


-0.263 


0.821 


m 5 0.002 ± 0.051 


-0.153 ± 0.042*** 


-0.014 


-0.397 


-0.689 


0.583 


0.168 


B. Arena female choice trials 














m, 0.159 ±0.100 


0.181 ± 0.075* 


0.539 


0.038 


0.674 


-0.335 


0.375 


m 2 -0.120 ±0.108 


0.134 ± 0.070 


0.693 


-0.265 


-0.205 


0.637 


-0.033 


m 3 0.020 ±0.104 


0.011 ± 0.081 


-0.461 


-0.566 


0.451 


0.426 


0.288 


m 4 0.069 ± 0.106 


-0.060 ± 0.076 


-0.088 


0.649 


0.464 


0.497 


-0.329 


m 5 0.046 ±0.1 14 


-0.299 ± 0.079*** 


-0.093 


0.433 


-0.291 


0.230 


0.817 



Randomization tests: *P < 0.05, **P < 0.01, ***P < 0.001. 



Experiment 2: measuring female 
preferences for male song 

Female choice trials revealed no significant linear selection 
on the five male song characters (Table 2B). There was 
significant negative quadratic selection on DF indicating 
stabilizing selection, but the other standardized quadratic 
selection gradients were not significantly different from 
zero (Table 2B). There was significant positive correla- 
tional selection between TD and DF, but none of the 
other standardized correlational gradients were significant 
(Table 2B). 

Canonical analysis of y identified two eigenvectors with 
significant nonlinear sexual selection, mi and m 5 



(Table 3B). The dominant eigenvector of nonlinear sexual 
selection (m 5 ) was characterized by a negative eigenvalue, 
indicative of stabilizing selection acting along this eigen- 
vector, and was most heavily weighted by DF (Table 3B). 
The remaining eigenvector (mi) had a positive eigenvalue, 
indicative of disruptive sexual selection operating along 
this eigenvector, with the strongest weightings coming 
from positive contributions of PD and TD (Table 3B). 
This mixture of a positive and negative eigenvalues dem- 
onstrates that the fitness surface is best described as a mul- 
tivariate saddle and this can be visualized using thin-plate 
splines in Figure 5. There was no significant linear sexual 
selection (0,) along any of the major eigenvectors of the 
fitness surface identified by canonical analysis (Table 3B). 
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4 3 2 1 0 -1 

M1 



Figure 3. Thin-plate spline (A) perspective view and (B) contour map 
visualization of the two major axes of nonlinear selection (m-i and m 5 ) 
operating on songs of males captured in the wild. Each point on the 
contour plot represents an individual male. 

Partial F tests from sequential modeling showed no sig- 
nificant difference in linear (F 5A77 = 0.764, P = 0.576), 
quadratic (^5,4,57 = 0.974, P = 0.433), or correlational 
(-Fio,447 = 1-474, P = 0.146) selection gradients for songs 
of males captured in the field and synthesized songs 
broadcast to females in choice trials. The angle between 
the dominant eigenvectors of nonlinear sexual selection 
(m l for the field study and m 5 for our manipulative 
study) was 89.94° (95% credible intervals: 69.87°- 
109.60°) suggesting that these dominant eigenvectors are 
almost orthogonal in canonical space. Consequently, 
although our sequential model building approach showed 
little difference in the standardized selection gradients for 




2 



Figure 4. Thin-plate spline visualizations (perspective view only) of 
nonlinear selection operating on songs of males captured in the wild. 
(A) mi and m 2 (B) m 2 and m 5 . 

call parameters across experiments, the large angle 
between the dominant eigenvectors suggests that the mul- 
tivariate combination of call parameters under the strong- 
est sexual selection differs significantly across our 
experiments. 

Discussion 

Our field study revealed significant nonlinear sexual selec- 
tion on song traits of male sagebrush crickets captured 
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2 10-1 
M1 



Figure 5. Thin-plate spline (A) perspective view and (B) contour map 
visualization of the two major axes of nonlinear selection (m-, and m 5 ) 
operating on synthesized songs broadcast to females in choice trials. 
Each point on the contour plot represents a synthetic song. 

from wild populations. Specifically, there was significant 
disruptive selection acting on rri! and significant stabiliz- 
ing selection on m 5 , the two major axes of nonlinear 
selection, resulting in the saddle-shaped selection surface 
depicted in Figure 3. The major feature of this rotated 
fitness surface is a rising ridge that peaks at high values 
of m 1; corresponding to longer TDs and PDs, and longer 
intertrain intervals. Longer trains and pulses may be 
favored because the increased sound output they repre- 
sent leads to increased mate attraction, but the fact that 
longer ITDs are also favored suggests an energetic 



constraint. Males capable of producing sustained call 
trains may have to take longer "time outs" to recover 
from these bouts of intense energy expenditure. Such a 
trade-off is also evidenced by the positive correlational 
selection acting on TD and ITD. This interpretation is 
further bolstered by the results of the playback experi- 
ment, the analysis of which also revealed significant non- 
linear selection on male song. Just as in the field results, 
disruptive selection on one of the major axes of nonlinear 
selection (mi) and stabilizing selection on another (m 5 ) 
produced a saddle-shaped fitness surface with a rising 
ridge peaking at high levels of m, (Fig. 5). High values of 
mj in playback trials also correspond to longer train and 
PDs, but in contrast to field results, shorter ITDs. In 
effect, high values of durations m, represent greater 
sound output and calling effort and, consistent with 
previous studies, such songs may be favored because they 
result in greater passive attraction of females, are actively 
preferred by females, or both (Holzer et al. 2003; Brooks 
et al. 2005; Bentsen et al. 2006). However, songs used in 
the playback experiment were deliberately constructed so 
that different song traits were randomly selected to cap- 
ture the entire range of phenotypic combinations of song 
parameters, even those that may be biologically implausi- 
ble. Thus, although songs with longer train (and pulse) 
durations and shorter intertrain intervals may confer the 
highest fitness, there may be few, if any males capable of 
optimizing both song parameters owing to energetic con- 
straints or biomechanical limitations imposed by their 
forewings. Under these circumstances, the best male strat- 
egy may be to produce longer TDs at the cost of having 
to take slightly longer respites from calling. 

In addition to m 1; a second minor axis of disruptive 
selection emerged from our field study, m 2 , high values 
of which correspond to long PDs and increased IPDs, 
and to a lesser extent, shorter TDs. Inspection of the fit- 
ness surfaces in Figure 4 suggests that maximal fitness is 
attained at high levels of m 2 . Thus, longer PDs appear to 
be favored at the expense of increased time between 
pulses. Longer PD (and chirp rate) has been shown to be 
associated with increased oxygen consumption in the var- 
iable field cricket, Gryllus lineaticeps (Hoback and Wagner 
1997). If, in fact, longer train and PDs are more energeti- 
cally costly, this would be consistent with the theory that 
energetically costly signals are attractive to females 
because they are honest indicators of male quality (Zahavi 
1975; Burk 1988; Welch et al. 1998). Indeed, the saddle- 
shaped fitness surfaces obtained in both the field and 
playback studies appears consistent with signaling theory 
(Getty 1998), which posits that attraction of females at a 
greater than linear rate per unit investment in the signal 
enforces the honesty of the signal. An alternative explana- 
tion is that the ordinary least square regression-based 
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approach of Lande and Arnold (1983) predisposes the 
fitness surface to take this saddle-shaped form (Shaw and 
Geyer 2010). More work is needed, however, to test 
between these alternatives (Hunt and Sakaluk in press). 

Examination of the standardized gradients from the 
field study suggests that dominant carrier frequency is 
subject to stabilizing sexual selection (Table 2B). This 
finding was reinforced by canonical analysis of y from 
playback trials. Specifically, there was significant stabiliz- 
ing selection acting on m 5 , the most dominant eigenvec- 
tor of nonlinear sexual selection, which was most heavily 
weighted by DF. This suggests that the auditory system of 
females is most sensitive to the DF of male song, 
13.0 kHz (±0.6 kHz). The auditory system of C. strepi- 
tans has not been characterized, but hearing in the conge- 
ner C. monstrosa is based on a primitive two channel 
hearing system with auditory receptors of two types: low- 
tuned receptors most sensitive to 2 kHz signals and 
broadly tuned receptors that respond to a full range of 
frequencies up to (and possibly above) 20 kHz, including 
the 12 kHz DF of C. monstrosa song (Mason et al. 1999). 
Differential activation of these two auditory receptor 
types allows C. monstrosa to crudely discriminate between 
high and low frequency signals, but they are apparently 
incapable of discerning fine frequency differences 
between conspecific signals (Mason et al. 1999). In con- 
trast to C. monstrosa, selection for intermediate dominant 
frequencies resulting from female phonotactic preferences 
suggests that C. strepitans has a more fine-tuned auditory 
system that is capable of perceiving frequency differences 
among signals of competing males. The quadratic selec- 
tion gradient for DF was —0.359 (Table 2B), which is 
more than three times stronger than the typical range of 
values between —0.1 and 0.1 reported in many previous 
studies (Kingsolver et al. 2001). 

Although our sequential model building approach 
showed little difference in the standardized selection 
gradients for call parameters across the field study and 
the playback experiment, the large angle between the 
dominant eigenvectors suggests that the multivariate com- 
bination of call parameters under the strongest sexual 
selection in the field study differs significantly from that 
of the playback study. Selection documented in the play- 
back study can only be attributed to female song prefer- 
ences because in playback trials, synthetic signals were 
used, thus completely eliminating any influence of male- 
male signaling interactions. In contrast, the pattern of 
selection revealed in the field study could, in theory, be 
attributed to signaling interactions between males occur- 
ring in the context of territorial disputes, female song 
preferences, or both. As noted earlier, acoustic interac- 
tions between neighboring males do occur in C. strepi- 
tans, but we do not know which songs traits are 



important in resolving acoustic contests. In house crick- 
ets, temporal aspects of males' aggressive songs are corre- 
lated with their ability to win fights (Brown et al. 2006). 
Similarly, certain features of male sagebrush cricket song 
may be important in mediating territorial disputes, but if 
that were the case, song traits favored in the context of 
male-male competition might differ from those favored 
by female mating preferences. Thus, the difference in the 
pattern of multivariate sexual selection revealed in the 
two studies may be due to selection imposed through 
male-male signaling interactions in the field. Our study 
also demonstrates the utility of combining measurements 
of selection in unmanipulated wild populations with stud- 
ies in which the male trait of interest is experimentally 
manipulated. Although playback trials revealed that 
females prefer song traits closely aligned with greater 
sound output and calling effort, the pattern of nonlinear 
selection documented for field-collected males suggests 
that males in nature face energetic constraints that physi- 
cally preclude them from producing the phenotypically 
most attractive trait combinations. Thus, studies focusing 
solely on the manipulation of single song parameters in 
isolation may result in misleading inferences concerning 
the form and strength of selection acting collectively on 
traits. 

It is worth noting that neither the field study, nor 
experimental playback trials, detected any significant 
directional selection acting on any of the song traits, in 
contrast to previous studies of field crickets (Holzer et al. 
2003; Brooks et al. 2005; Bentsen et al. 2006), but similar 
to a study by Hall et al. (2008) examining postcopulatory 
sexual selection in the Australian field cricket Teleogryllus 
commodus. Thus, had our study focused on strictly on 
linear selection, we would have concluded erroneously 
that sexual selection does not act on male song structure. 
While many studies have measured selection acting on 
sexual signals in insects, it has only been more recently 
that studies have conducted a complete multivariate selec- 
tion analysis, including canonical rotation, to examine the 
strength and form of sexual selection that female mating 
preferences impose on males (review in Hunt and Sakaluk 
in press). Those that have invariably have revealed a com- 
plex pattern of selection on male sexual traits that, with 
one exception (Brooks et al. 2005), have produced fitness 
surfaces characterized by the multivariate saddle of the 
kind documented in the present study (Blows et al. 2003; 
Bentsen et al. 2006; Hall et al. 2008; Judge 2010; Gersh- 
man et al. 2012; Hunt and Sakaluk in press). 

No difference was found in linear, nonlinear, or corre- 
lational selection gradients among the three study popula- 
tions. The populations are located relatively close together 
along the Snake River and its tributary, Pacific Creek, 
with only 7.4 km separating the Bridger-Teton and Pacific 
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Creek populations, and 13.6 km between Pacific Creek 
and Deadman's Bar. Because our populations are rela- 
tively close together and share very similar habitats, they 
likely are exposed to similar predation pressure and envi- 
ronmental conditions, which could explain the similar 
fitness surfaces in these populations. There also may be 
undiscovered backcountry sagebrush cricket populations 
that could facilitate sufficient gene flow between popula- 
tions to offset any differences. Alternatively, our study 
populations could be relicts of a much larger population 
that historically inhabited the broader Grant Teton region 
and have not had sufficient time to diverge. The lack of 
differences in sexual selection gradients among popula- 
tions further indicates that female preferences for male 
song likely will not be a driving factor in the future diver- 
gence of these populations, unless female preferences 
change within populations. 

In conclusion, our study, which involved measures of 
male mating success in wild populations along with experi- 
mental manipulation of male song played back to females 
in dichotomous mate choice trials, revealed a complex 
pattern of multivariate nonlinear selection characterized 
primarily by strong stabilizing and disruptive selection on 
male song traits; correlational selection was weak or nonex- 
istent. There was also no evidence of significant directional 
selection on any song characters. Our study cautions 
against the use of strictly manipulative studies of male sex- 
ual signals to measure selection resulting from female 
choice because while such studies can reveal the trait com- 
binations females prefer, selection on male traits in wild 
populations will ultimately reflect physical and energetic 
constraints that may preclude certain phenotypic variants. 
However, the use of such studies in concert with measure- 
ment of selection on sexual signals of males in natural pop- 
ulations can help disentangle the relative contributions of 
male competition and female mate choice to the evolution 
of signal structure. Like Bentsen et al. (2006), we also cau- 
tion against laboratory studies that draw inferences about 
sexual selection based on the manipulation of sexual signals 
in a single dimension, as these will invariably underestimate 
the strength and complexity of sexual selection acting on 
male sexual traits. 
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